Hand shape Coding for HMM-based Consonant Recognition in Cued Speech for French

نویسندگان

  • Noureddine Aboutabit
  • Panikos Heracleous
  • Denis Beautemps
چکیده

Cued Speech (CS) is a visual communication mode that makes use of hand shapes placed in different positions near the face in combination with the natural speech lipreading, to enhance speech perception from visual input. This system is based on the motions of the speaker’s hand moving in close relation with speech. In a CS system, hand shapes are designed to distinguish among consonants and hand placements are used to distinguish among vowels. Due to the CS system, both manual and lip flows produced by the CS speaker carry a part of the phonetic information. This contribution presents automatic hand shape coding of a CS video recording with 92% obtained accuracy, and multistream hidden Markov models (HMMs) fusion to integrate hand shape and lip shape elements into a combined component and perform automatic recognition of CS for French. Compared with using lip shape modality alone, by applying fusion the accuracy of CS consonant recognition was raised from 52.1% to 79.6%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cued Speech automatic recognition in normal-hearing and deaf subjects

This article discusses the automatic recognition of Cued Speech in French based on hidden Markov models (HMMs). Cued Speech is a visual mode which, by using hand shapes in different positions and in combination with lip patterns of speech, makes all the sounds of a spoken language clearly understandable to deaf people. The aim of Cued Speech is to overcome the problems of lipreading and thus en...

متن کامل

A HMM recognition of consonant-vowel syllables from lip contours: the cued speech case

Cued Speech (CS) is a manual code that complements lipreading to enhance speech perception from visual input. The phonetic translation of CS gestures needs to combine the manual CS information with information from the lips, taking into account the desynchronization delay (Attina et al. [1], Aboutabit et al. [2]) between these two flows of information. This paper focuses on HMM recognition of t...

متن کامل

Cued speech recognition for augmentative communication in normal-hearing and hearing-impaired subjects

Speech is the most natural communication mean for humans. However, in situations where audio speech is not available or cannot be perceived because of disabilities or adverse environmental conditions, people may resort to alternative methods such as augmented speech. Augmented speech is audio speech supplemented or replaced by other modalities, such as audiovisual speech, or Cued Speech. Cued S...

متن کامل

A pilot study of temporal organization in Cued Speech production of French syllables: rules for a Cued Speech synthesizer

This study investigated the temporal coordination of the articulators involved in French Cued Speech. Cued Speech is a manual complement to lipreading. It uses handshapes and hand placements to disambiguate series of CV syllables. Hand movements, lip gestures and acoustic data were collected from a speaker certified in manual Cued Speech uttering and coding CV sequences. Experiment I studied ha...

متن کامل

Characterizing and classifying cued speech vowels from labial parameters

As part of the THIMP project (Telephony for HearingIMpaired People), we aim at automatically analyzing Cued Speech [1] and translating it into oral spoken language. This work focuses on vowel classification and will be part of this transcoding process as a preprocessing step of the input data analysis. Its objective is to identify vowels produced by a speaker pronouncing and coding in Cued Spee...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013